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ABSTRACT 

Consider  the  standard  linear  model  Y^  =  x.Jb  +  e^ ,  i  =  l,...,n,...  where 
x^,  x2,  ...  are  assumed  to  be  known  p- vectors,  B  the  unknown  p- vector  of 
regression  coefficients,  and  e^ ,  e2,  ...  the  independent  random  error 
sequence  each  having  a  median  zero.  Define  the  Minimum  L^-Norm  estimator 
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inf^Ii=l lYi  ~ x]b| :  B  e  RP|.  It  is  proved  in  this  paper  that  gn  is 
asymptotically  normal  under  very  weak  conditions.  In  particular,  the 
condition  imposed  on  {x^}  is  exactly  the  same  which  ensuring  the  asymptotic 
normality  of  Least  Squares  estimate:  11mn(mmaXj  ;inxj(Ej-lxjxj^xi  =  0. 
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1.  INTRODUCTION  AND  SUMMARY 


Consider  the  standard  linear  regression  model 
Yi  =  2 i £o  +  ei *  1  =  1 . . 


(1.1) 


where  x^,  x2>  ...  are  assumed  to  be  known  p-vectors,  the  unknown  p- vector 
of  regression  coefficients,  and  e^ ,  e2,  ...  the  i.i.d.  random  errors  with  a 
common  density  function  f  and  median  zero,  f  is  continuous  at  0  and  f(0)  >  0 
The  Minimum  L^-Norm  ( ML ^ N )  estimate  of  j3q  is  defined  as  a  solution  of  the 


if  |Yi  -xj§nl  »  1nf{  j  |Yrxjj|:  6  e  Rp}. 


(1.2) 


Here  we  assume  that  the  parameter  space  is  the  whole  p-dimensional  Euclidean 
space  Rp.  It  will  be  indicated  (see  Corollary  3  below)  that  no  change  in 
the  argument  is  needed  when  Rp  is  replaced  by  any  of  its  subset  containing 
the  true  parameter  as  an  inner  point. 

The  ML-jN  estimate,  whose  usefulness  is  by  now  universally  recognized, 
dates  back  to  Laplace.  But  for  a  long  time  in  history  it°never  attracted 
much  attention.  One  reason  is  in  the  difficulty  of  its  computation,  which 
has  now  been  resolved  with  the  advent  of  modern  computing  facilities,  and 
the  paper  of  Charnes  et  at  (1955)  linking  the  computation  of  8p  to  the  solu¬ 
tion  of  a  linear  programming  problem.  Another  reason  is  the  lack  of  an  ade¬ 
quate  asymptotic  theory.  It  is  well  known  that  in  the  problem  of  estimating 
the  median  of  a  univariate  population,  the  sample  median  is  (under  certain 
conditions)  asymptotic  normal.  Motivated  by  this  simple  case,  write 


?„  ■  i:i 


(1.3) 


2 


it 

It  is  naturally  expected  that  ( — *■  means  convergence  in  distribution) 
2f(°)sy2(in  -  gQ)  -£•  N(0,  I  ) ,  asn^ 


0.4) 


under  reasonable  conditions.  Here  I  is  the  identity  matrix  of  order  p. 

~P 

The  first  attempt  to  give  a  proof  of  (1.4)  was  made  by  Bassett  et  al 
(1978).  They  assumed  that  {e^}  satisfies  the  conditions  stated  earlier,  the 
solution  of  (1.2)  is  unique  (a  condition  difficult  to  justify),  and  that 


S  /n  -*■  Qj  a  positive  definite  matrix, 
•n  * 


(1.5) 


Unfortunately  their  argument  contains  serious  mathematical  gaps  which  do  not 
seem  easy  to  resolve.  For  one  thing,  they  overlooked  the  fact  that  the  o ( 1 ) 
at  the  right-hand  side  of  the  equation  above  (3.10)  of  their  paper  should  be 
oh(l),  and  it  is  by  no  means  clear  that  the  convergence  (as  T  -*•  «)  0^(1)  0 

should  be  uniform  over  h  e  H.  Moreover,  the  assertion  (3.9)  is  not  generally 
valid.  A  simple  counter-example  is  (in  notations  of  their  paper): 

Yt  =  xt6  +  ut’  *  =  (S:  one-dimensional) 

x^  =  l/v7,  x?  =  1  +  v^/10,  x3  =  x^  =  ...  =  1 

u-j,  u2,  ...  i.i.d.,  u^  -  N(0 ,1 ) . 

It  is  easy  to  verify  that  all  conditions,  including  the  uniqueness  assumption 
and  nonlattice  condition,  are  satisfied.  But  it  can  easily  be  shown  that 

PT(zT(S,h)  e  C[0,1])  =  0 

for  h  =  1  (which  belongs  to  H  =  {1,2,3,  ...T})  when  T  =  2,4,6,...,  and  (3.9) 
breaks  down  (see  Appendix  1). 

Bloomfield  and  Steiger  (1903)  advanced  a  proof  of  (1.4)  under  the  assump¬ 
tion  that  x-j ,  x^,  ...  are  observations  of  a  random  vector  X  with  a  positive 


3 


definite  covariance  matrix,  and  (x-pY-j),  (x2,Y2),  ...  are  stationary  and 
ergodic.  Unfortunately  they  failed  to  notice  that  for  {gn(c)}  (defined  by 
(6)  on  p.45  of  their  book)  to  be  equicontinuous,  gp ( c )  must  be  defined  as 
X^_1  hn(ri  (£))  /n»  and  not  ["_ihn(r.  (c))  as  in  their  book.  But  if  9p(£)  is 
defined  as  lj=1hn ^ (c)) /n,  the  assertion  rf^D^c^)]  2f (0) C  on  p.47 

should  be  [D„(c  )]  2f(0)C,  and  one  can  only  obtain  c  -  a  +  0  in  probabil- 

ity,  not  the  crucial  assertion  (8)  on  p.46  of  their  book,  and  the  proof 
breaks  down.  Besides,  they  made  the  mistake  that  the  function  hn(t)  defined 
on  p.45  of  their  book  has  no  second  order  derivative  at  t  =  ±n~'3,  making 
the  relation  (12)  on  p.47  invalid. 

Meanwhile  Amemiya  (1982)  gave  a  proof  of  (1.4)  by  approximating  the 
absolute  value  function  with  a  twice-differentiable  one.  He  made,  in  addi¬ 
tion  to  (1.5),  the  assumption  that  {x^}  is  a  bounded  sequence,  and  that  Bq 
is  confined  in  a  compact  region.  Unfortunately  his  proof,  too,  is  invalid. 

One  problem  is  that  his  assertions  (in  notations  of  his  paper)  ->•  0  in 
(3.12)  and  B^  -*•  0  ^in  (3.22))  are  both  incorrect.  Quite  contrary,  we  have 
shown  by  simple  arguments  that  actually  A.|  -*■  »,  B-|  -*■  »  in  probability 
(see  Appendix  2).  Another  crucial  point  is  that  in  order  to  show 
T(fl  -  6*) '  (b  -  6*)  0  in  probability  as  T  -*•  »,  (3.11)  should  be  understood 

as  sup{  |S*(b)  -  S  ( 6 )  | :  6  €  0}  0  in  probability  (D  is  the  parameter  space), 

while  his  argument,  even  freed  from  the  error  indicated  above,  is  obviously 
not  sufficient  for  this. 

When  the  regression  model  contains  a  constant  term:  Y^  =  Qq  +  x!§q  +  e., 
i  =  1,2,...,  Bloomfield  and  Steiger  (1983,  p.62  Lemma  1)  noticed  the  interest¬ 
ing  fact  that  the  ML,N  estimator  8  of  Bn  is  in  fact  a  special  case  of  a  class 

i  ~n  ~u 

of  rank  estimators  introduced  by  Jaeckel  (1972).  Jackel  showed  that  his 


estimator  is  asymptotically  equivalent  to  an  estimator  introduced  by 
Jureckova  (1971).  From  this  a  proof  of  asymptotic  normality  of  the  ML^H 
estimator  §n  can  be  obtained  by  using  the  theorem  proved  by  Jureckova 
(1971).  However,  this  does  not  give  a  satisfactory  solution  of  the  problem 
for  the  following  two  reasons:  First,  Jureckova's  theorem  imposes  very  cum¬ 
bersome  conditions  on  the  sequence'  {x.}  which  are  difficult  to  verify.  Her 
theorem  also  requires  the  existence  of  Fisher  information  of  the  density  f 
of  the  error,  so  f  must  be  positive  and  absolute  continuous  on  R1 .  Even 
the  simple  uniform  distribution  R( -1 ,1 )  does  not  meet  this  condition. 
Second,  the  theorem  so  obtained  cannot  deal  with  the  case  of  (1.1)  in  which 
no  constant  term  is  present.  If  such  a  constant  is  present,  the  theorem 
.  cannot  deal  with  this  term. 

Dupacova  (1987)  proved  a  theorem  concerning  the  asymptotic  normality 
of  possible-constrained  ML-jN  estimates  in  case  that  {x.}  is  a  random 
sequence.  Her  theorem,  when  applied  to  the  unconstrained  case,  gives 
roughly  the  result  stated  by  Bloomfield  and  Steiger  (1983),  as  mentioned 
earlier.  There  is  a  mathematically  undesirable  condition  in  her  theorem: 
llx-jll  possesses  a  finite  moment  of  third  order. 

It  is  the  purpose  of  this  paper  to  give  a  rigorous  proof  of  (1.4) 
under  minimum  conditions.  First,  in  the  i.i.d.  case,  we  have  the  follow¬ 
ing  theorem: 

THEOREM  1.  Suppose  that  in  model  (1.1),  e^,e£,  ...  are  independent 
and  identically  distributed  with  a  common  distribution  function  F,  and  the 
following  two  conditions  are  satisfied. 

1.  There  exists  a  >  0  such  that  f(u)  =  F’(u)  exists  when  |uj  <  a,  f  is 
continuous  at  0,  f ( 0)  >  0  and  F(0)  =  1/2. 


(1.6) 


2.  Sn  is  nonsingular  for  some  n,  and 

•  _i 

lim  max  x.Sn  x.  =  0. 
n-*»  1  _<  i  _<  n  ~  ~1 

Then  (1.4)  is  true. 

Remark.  The  condition  (1.6)  is  exactly  the  same  as  that  which  guarantees 

the  asymptotic  normality  of  the  Least  Squares  estimate  of  g  in  case  that 

«• 

2 

{e.j}  is  i.i.d.  and  Ee^  =  0,  0  <  Ee-j  <  <=.  It  was  expected  that  the  conditions 
ensuring  the  asymptotic  normality  of  Minimum  L-j-Norm  estimate  might  be  more 
stringent  (as  compared  with  the  LS  case),  as  the  Minimum  L-j-Norm  estimate  is 
nonlinear  while  the  LS  estimate  is  linear. 

COROLLARY  1.  If  {e^}  is  i.i.d.,  condition  1  is  satisfied,  and  there 
exists  constant  sequence  (gn }  such  that  gn  -*•  <*>,  gn+-j/gn  -*  1,  and 

Sn/9n  ■+  A  Positive  definite.  (1.7) 

Then  (1.4)  is  true. 

Wu  (1981)  mentioned  this  condition  in  connection  with  the  problem  of 
consistency  of  LS  estimates. 

This  corollary  contains,  as  a  special  case,  the  result  stated  in  Bassett 
and  Koenker  (1978).  In  turn  it  implies  the  following  result: 

COROLLARY  2.  Suppose  that  {e..}  is  i.i.d.,  condition  1  is  satisfied, 
and  x.|,  X£,  ...  are  i.i.d.  observations  of  a  random  vector  X  such  that 
E(XX')  is  positive  definite,  { x . }  and  {e  }  are  independent.  Then  with  prob- 

— ~  l  i 

ability  one  (for  almost  every  sample  sequence  fx^l),  (1.4)  is  true. 

Of  course,  we  need  not  consider  (1.4)  as  a  conditional  statement:  it 
is  also  true  unconditionally.  Thus  we  reach  the  conclusion  stated  in 
Bloomfield  and  Steiger  (1983). 


Since  p  ,  >  0,  from  (1.9)  and  (1.10),  we  have  lim  p  .  =  and  (1.8)  is 
m  i  n-+»  n  I 

proved. 

From  Corollary  3  it  follows  that  if  we  use  a  subset  G  containing  p^  as 
an  inner  point  to  replace  Rp  in  (1.2),  and  denote  the  resulting  solution  by 
f n (G) ,  we  shall  have  P(ji  (G)  ^  H  )  -*•  0  as  n  *  -.  Hence  (1.4)  is  still  true 
if  §n  is  replaced  by  6  (G). 

In  passing  we  note  that  Y.  Wu  (1987)  proved  the  strong  consistency  of 

A 

gn  under  conditions  slightly  stronger  than  those  of  Theorem  1.  It  does  not 
seem  possible  to  give  a  proof  under  the  conditions  of  Theorem  1. 

In  practical  applications  there  is  usually  a  constant  term  in  the  re- 


gression  function,  and  instead  of  (1.1)  we  have  the  form 


vi  ■ 


?i-60  +  ei> 


1  l,...,n,.. 


(1.11) 


Although  (1.11),  as  a  special  case  of  (1.1),  can  be  dealt  with  by  Theorem  1, 
for  inference  purpose  it  will  be  convenient  to  have  a  theorem  formulated 
in  the  following  manner. 


THEOREM  2.  Write  (an,S|!))  the  Minimum  L.-Norm  estimate  of  (ciq,^),  and 


*»■  <V  — +in)/n*  In"  T/ii'JnXjrJn*' 


(1.12) 


Suppose  that  {e.}  is  i.i.d.,  condition  1  of  Theorem  1  is  satisfied,  Jn  is 
nonsingular  for  some  n,  and  that 


lim  max  (xi  -  xn)'J^1(xi  -  xn)  =  0. 
n-H»  1  <i  <n 


(1.13) 


Then  as  n  ->  we  have 

2f(0)In/2(®n  “  2o)  Ip}  (1J4) 


2f(0)/n 


/l  +nx’T"1xn 
-n-n  -n 


(“n-a0) 


f. 


N(0,1). 


(1  15) 


Also,  the  two  variables  2f(0)jJ/2(gn  -  gQ)  and  2f (0) /n{ ( ap  -  aQ)  +  x^(gn  -  gQ) > 
are  asymptotically  independent. 

We  note  that  the  weak  consistency  of  a  and  e  still  holds  true.  For 

J  n  cn 

the  assertion  follows  from  (1,14)  and  T~^  0,  which  is  a  consequence  of 

(1.13),  in  much  the  same  way  as  (1.8)  is  a  consequence  of  (1.6),  For  ap 

the  assertion  follows  from  (1.15)  and  "x'T'^x  -*  0,  which  is  a  trivial  cor,- 

-n^n  ~n 

sequence  of  (1,13)  and  the  fact  that  T~^  -*■ 


0. 
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Corollary  2  can  trivially  be  modified  to  accomodate  the  model  (1.11). 
All  we  need  to  do  is  to  replace  the  matrix  E(XX')  by  the  covariance  matrix 
E(X  -  EX) (X  -  EX) 1 . 

•m  %  *« 

The  above  two  theorems  can  easily  be  extended  to  the  case  that  e-j , 
eg.  ...  are  independent  but  not  necessarily  identically  distributed.  We 
shall  only  give  a  formulation  of  the  following  result. 


THEOREM  3.  Suppose  that  in  model  (1.1),  e.| ,  e^,  ...  are  independent, 
the  distribution  function  F^x)  is  differentiable  over  an  interval  (-a,a), 
F^O)  =  1/2,  i  =  1,2,...  and  a  >  0  does  not  depend  on  i.  Write  f^(x)  =  F!(x). 
Suppose  that  (f.(x)}  is  eauicontinuous  at  x  =  0  and  0  <  inf^f^O)  _< 
sup.jf.(0)  <  «.  Finally,  suppose  that  (1.6)  is  true.  Then  as  n  >  »,  we 
have 


-  So> 


N(0,  I). 

-  -p 


(1.16) 


Our  main  task  is  to  prove  Theorem  1.  Once  this  is  achieved,  only  some 
trivial  modifications  are  needed  in  proving  Theorem  3,  and  much  the  same  can 
be  said  about  Theorem  2.  To  prove  Theorem  1,  it  wiJl  be  found  convenient 


to  reformulate  the  original  problem  in  the  following  manner.  Write 


S’^^x  =  x  i  =  1  n 

rn  ~i  ~ni  ’  1  ■»•••» 


SV2B  =  6 
-n  ~0  -nO’ 


Y  .  =  Y., 
ni  i 


e  .  =  e. 

m  i 


Then  (1.1)  has  the  form 


pODTOTOWfk^WWfVyU^WWW1 TODWCWKWOOWW  7W  TV*  ’>  '■-*  \}*~mM~*.  v*  V*  w>  '."*  ,-v  Wwi  vu".' \  v  v  v  .c.  »■:  .v  77  -*~'v.v\ -wri."*  w* 
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Denote  by  ep  the  Minimum  l^-Norm  estimate  of  Bp0  in  (1.17).  Then  we  have  the 
following  general  theorem,  which  includes  Theorem  1  as  a  special  case, 

THEOREM  V.  Suppose  that  in  model  (1.17),  xnl ,  ....  xnn  are  known  p- 
vectors  satisfying  (1.18),  e^,  ...,  enn  are  i.i.d.  variables  whose  common 
distribution  function  F  does  not  depend  on  n  and  satisfies  the  condition  1 
of  Theorem  1.  Also,  assume  that 

d  =  max  ||xn.|!  ->0,  as  n  ->■  «  (1.19) 

1  <i<n 

where  || •  ||  is  the  Euclidean  norm  in  Rp.  Then  as  n  -*•  <*>  we  have 

2f(0) (|n  -  Bng)  N(O.Ip).  (1.20) 

This  theorem  will  be  proved  in  Section  3.  Part  of  the  reasoning  is 
contained  in  Section  2  in  the  form  of  several  preliminary  lemmas. 
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2.  SOME  LEMMAS 


LEMMA  1  (Bennett).  Suppose  that  ....  are  independent,  E^.  =  0, 
U-jl  <*><»,<  =  l,...,n,  where  b  is  a  constant.  Write  V  =  J?=1Var(s. )/n. 
Then  for  any  e  >  0,  we  have 


I  I  5,-/n|  >  e)  <  2exp(-ne2/(2V  +  2bE)Y 
i=l  ' 


(2.1) 


For  a  proof,  see  Bennett  (1962). 


In  model  (1.17),  we  can  assume  that 


-n0  "  2 


(2.2) 


without  any  loss  of  generality.  This  we  shall  always  do  in  the  sequel,  and 


we  have  Yni  =  e^.  For  any  vector  a  =  (a^ . ap)'  in  Rp,  write  |a|  = 

maxl<i<plai I •  will  be  used  to  denote  the  indicator  of  the  set  A. 


LEMMA  2.  Suppose  that  in  model  (1.17)  the  conditions  of  Theorem  1'  are 


satisfied,  with  the  possible  exception  of  (1.19).  Then  we  have 


‘(I  j,s9n(yni-!!nf6n>!Snll  i  <P+1>dn)  ±  1 0 6n I  i  (2-3) 


with  probability  one,  where  sgn(0)  =  0,  sgn(a)  =  a/|a|  for  a  f  0. 


Proof.  Since  by  definition  £n  is  a  minimization  point  of 


^i=ll^ni  ”xni-^  as  a  ^unc^10n  B  €  Rp,  for  any  unit  vector  e  e  Rp,  we  have 


by  taking  directional  derivative 


J,s9n<Yni  -  :Al€n)SnlSI(Yn1  ^  inl^n5  +  i  lini'l '<Yni  =  i)ni2n>  i  °' 
1=1  1=1 


This  implies,  in  view  of  the  arbitrariness  of  e,  that 


Now  suppose  that 


lgnl  <  A/(^dn).  If  1  <.  i-j  ^  <  ...  <  i  i  <_  n  and 


Ynk  =  ?nk!n’  k  =  il»,*“ip+r 


Then  find  real  constants  c-|,...,c  +-j  not  equaling  to  zero  simultaneously 
such  that  c1xnl^  +  ...  +  cp+1xni  =  0.  We  have 


C-.Y  .  +  ...  +  r  ,Y 


p+t  n1p+l 


(2.5) 


Considering  this,  and  the  fact  that  J B_ |  <  A/(/pd  )  implies  | x*  .6  I  <  a, 

~n  n  ~m~rv  - 


we  reach  the  following  conclusion: 


The  event  j  I- *n1  1 1  (Yni  =  x*.£n)  >  (p+l)dn>  and  |gn|  <  A/(^dn)j 

C  the  event  j  there  exists  constants  c, ,  . . . ,  c  ., 

l<ij<.Vr<:ip+1<n|  1  P+1 

depending  only  on  x  .  ,  ...,x  .  ,  not  all  zero,  such  that 

n1l  nip+l 


j=lCjVn,j 


0 ,  and  | Yni  1  <  A,  j  =  1 . p+lj  . 


(2,6) 


Since  Y^  =  ,  e^ ,  ...,enn  are  i.i.d.  variables  whose  common  distribution 

function  is  continuous  over  (-A,A),  it  is  seen  that  the  probability  of  the 
event  on  the  right-hand  side  of  (2.6)  is  zero.  This  fact,  taken  together 
with  (2.4),  implies  (2.3),  and  the  lemma  is  proved. 

In  order  to  introduce  the  crucial  Lemma  3,  we  have  to  define  some 
notations. 


By  (1.19)  it  follows  that  there  exists  constant  sequence  (pn)  of  positive 


even  integers  such  that  as  n  •*■  ®, 


r»vnfwvwwvwu>rv«w#j^^ 


yn  ■*•  •»  M^dn  ■*  °»  Jp  Updn  <  1/2 ,  for  n  large. 


(2.7) 


In  the  following  we  assume  that  the  conditions  of  Theorem  V  are  met.  Write 


M  =  Mn  =  [log  n/logun],  rm  =  yJJ,  m  =  1.....M,  rM+1  =  n. 


(2.8) 


-  0^  -{0  ( f?i  >  . .  •  >  0_ )  !  i  -  l».o.»p). 


(2.9) 


Note  that  M  >_  1  for  n  large.  This  is  true  because  by  (1.18)  we  have 
2  1/4 

n/41-  v  1  f hnyof >  n  *»  ”  Dam+itiftn  n  l’ntft  a  nnmkav  rtf  lnfanol 


ndn  >_  1 ,  therefore  yp  <  n 


Partition  D  into  a  number  of  intervals  D-j , 


D2,  ....  Dj  each  having  the  form  (x  =  (x^,  ....  x^)' :  a^  <_  x^  <  b^ , 
i  =  1 ,. . .  ,p) ,  such  that 

l(V±1’  D.  0  Oj  =  0,  i.j  =  l.....Jr  i  t  J 

where  L( A)  is  defined  as  sup{|u-v|:  u  e  A,  v  e  A).  Now  each  subset  is 

again  partitioned  into  a  number  of  disjoint  intervals  ,  D^,  ....  D^j 

which  can  further  be  parti oned.  This  process  is  defined  inductively  as 

follows:  Suppose  that  after  the  (m-l)-th  round  we  have  partitioned  D  into 

(D.  .  },  then  in  the  m-th  round  we  take  each  D.  .  and  partition 

Jr**Jm-i  „  Jr--Jm-i 

it  into  a  number  of  disjoint  intervals  (D-  •  . :  z  =  1,2,...,J  },  such 

Jr*-Vr  m 


J  <  yf,  L(D . 


m  -  n  ’  -  n 


*  -2(m-l)  .  .  . 

'  —  Mn  m 


(2.10) 


The  process  ends  with  the  completion  of  the  (M+l)-th  round.  Denote 


G  =  {D.  .  }  =  the  partitioning  of  D  after  the  m-th  round 

m 

m  1,2,..., 


M+l. 


(2.11) 


pwww ■v^jirv’.-y  -wwv  -v-*  -j  -y  r  y  -j  yn*  y  - 
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A  typical  element  of  Gm  is  denoted  by  B  or  D..  .  ,  and  a  chosen  point  in 


it  is  denoted  by  b  or  b.  .  .  Put 


Jl*-'Jm 


»„i(6)  =  sgrKe^-x’,?)  -  sgn(enj) 

(2.12) 

Xni(6)  -  2(F(?-,B)  -  1/2)  =  -E*n1(|) 

(2.13) 

V§>  =  *„i<s>  + 

(2.14) 

By  (2.7),  when  3  e  D  and  i  £  n,  we  have,  for  large  n, 

I ?ni? I  <  ^Pdnl§l  <  (2pn)_1  (2.15) 

and  for  b  e  B,  b*  e  B  where  B  e  Gn>  we  have,  for  large  n, 

l;;,(b-b*)|  <  ^pdn|b-b.|  <  tfgjjy1.  1  <  1  <  n.  (2.16) 

From  (2.15)  and  the  conditions  imposed  on  the  distribution  function  F  (see 
condition  1  of  Theorem  1),  it  follows  that  we  can  find  a  constant  C  >  1  such 
that  for  g  e  D  and  1  <  i  <  n: 


[x 1 . B  +  y~2m) 
lCm~  Mn  1 


*  i  „  -2nu 

l?ni!"pn  ^ 


m 


m 


satisfying 


Y  a6.  =  1,  la  .|  <  v 

•  ^  m  ’  1  rn '  -  Hn 


-m/2 


>  -2m 
<  Lu 
—  Mn 

(2.17) 

-£ 

(2.18) 

i  CdnV 

(2.19) 

!  a  triangular  array  of  real 

numbers 

> 

for  i  >  r  +  1 
—  m 

(2.20) 

,2 , . . .  ,M+1 . 

•  vi 

■>: 

ft  .  M 

■V 

Iji 

a 


f 


Define  the  following  quantities 


Qm  =  I  pjsuP  I  ani(tni  -tni(b))l  >  e3""i  » 
m  BeG„  8eB  i=r  +1  nn  ni  m  -  /  |  - 


1  _<  m  <_  M  +  1, 


(2.21) 


rm+l  . 

=  I  p  stJP  I  anl(tni(B)’ tni(b))  l^"(m+1)  =  £  U(m,B), 

BeGm  /eeB  i-r+1  '  n1  '  >  ~  beV 


1  <_  m  <_  M, 


(2.22) 


i  ^  i  P  1-r^  +ia,li ^tn1’ ^-Ji* ^ 
Jl’---’Jm+l  1  rm+l  1  1  m  1  m  m+1 

>  e3-(m+1)j 

jr.-.j|n+lV^m’jr***’Jm+l)»  1  1  m  1  M*  VH  =  I 


(2,23) 


In  the  above  expressions  e  e  (0,1),  r  ,  m  =  1,...,M,M+1,  has  been  defined 

in  (2.8).  Further,  in  the  definition  of  V  ,  the  summation  runs  over  all 

m 

such  (j.,  . ..,  )„.,)  that  D.  .  is  a  member  of  G  . , ,  and  b.  .  is 
I  m+i  3VJm+l  m+ 

understood  as  the  point  chosen  in  D.  .  e  Gf,  while  in  the  definition  of 

J  1  *  *  •  J  £  ^ 

Qm  and  Um,  b  is  the  point  chosen  in  a  member  B  of  Gm,  as  stated  earlier. 

By  these  definitions  it  is  easily  seen  that 


Q»,  i  Um  *  \  *  Qm+1  •  l"*1 . M' 


(2.24) 


Therefore,  on  noticing  that  QM+1  =  0,  we  have 


Qi  <  I  (Um  +  VJ. 
1  m=l  m  m 


(2.25) 


B 


P< 


Kp 

p*s 


K* 


T"'.'  V.  w  V 


LEMMA  3.  Suppose  that  the  conditions  of  Th°nrem  T  are  met,  and 
{an-}  satisfies  (2.20),  e  e  (0,1).  Then  when  n  is  large  we  have 

um  +  vm  1  4^pmexp{-48_1€(9‘1yn)m/2}»  m  =  1 ....  ,M  (2.26) 

where  C  is  the  constant  appearing  in  (2.17)-(2.19). 


Proof.  Def i ne 


2.  wi*"  le„i <  % 


nn1  -  "ni(B)  * 


0,  otherwise 


(2.27) 


where  B  e  Gm  and  b  is  the  point  chosen  in  B.  By  (2.14),  (2.16)  and  (2.17), 


we  have 


m+i  .  . 

=UE  ,  J  *n -  tnl<6)) 


8eBli=rm+l 


III”  I  II*  *  I 

i  1=r^+1,an1 1 ("nl  -  Ean) >  +  21=  I  Iani lE"nl • 


From  (2.16),  (2.17)  and  (2.27),  we  have 


(2.28) 


Enni  -  Cyn  *  1  =  rm+1 **** ,rm+l * 


(2.29) 


By  (2.20),  we  have  for  n  large 


2  X  lan1lEnnii2(iT;](Ennj)2y/2i2(ur,cV"’) 


<  2CjTm  <  e2'13‘(m+1). 


—  n 


(2.30) 


From  (2.1),  (2.28),  (2.30)  and  by  Lemma  1,  we  obtain 


U(m,B) 


I  m 

<p  2 

(li=r 


I  Jan.|(nni-Enni)  >  e2  '3 


1--(m+l)| 


<2expj-c22-23-(^2)/[( 


max  Eti„. 

V-rm+l 


+  e3~  (m+1 )  max  |  a  .  | 


(2.31) 


U(m,B)  Is  defined  in  (2.22).  From  (2.17),  (2.20)  and  (2.29),  we  have,  for 


large  n, 


max  |a  .  |  <  y^m/2, 
i>r 

m 


e  2  or  '2m  -m/2  _-(m+l) 

“x  ""1  1  !i  ' 
m  —  m+1 


Therefore 


U(m,B)  <  2exp(-e3'(m+1)j/"/2/16) 


(2.32) 


which  implies 


u  <  z^expi-cS-^'Wie) 


m  —  "n 


(2.33) 


By  (2.20), 


4  rcax  |aniU3-(m+,)  <  4c3-<m+1>u;(m+1)/2. 


(2.34) 


By  (2.16)  and  (2.17),  we  have,  on  denoting  g  .  =  t  , (b.  .  ) 

n  1  n  1  ~  j 


W-i  i 

n  JVJm+l 


Eg2.  <  4)  F(x '  .b.  .  )  -  F  ( x 1 .  b  .  .  ) 

m  -  I  -n1~JvV  'nKJr  ”Jm+l 


<  2Cp"2m  <  4e3~(m+^)y”(rn+l)/2. 
-  Mn  -  Hn 


(2.35) 


ek 


v 

A; 


?! 


a 

pfc*' 


$ 


.%W  „N| 


In  order  to  do  this,  we  partition  the  interval  D  in  exactly  the  same  manner 
as  we  have  previously  done  for  D  defined  by  (2.9),  with  yn  replaced  by  vn> 
and  that  instead  of  (2.8),  we  now  define  M  as  a  positive  integer  satisfying 


W"  <  Z-'eS-^'V 


(2;39) 


where  e  >  0  is  an  arbitrarily  given  constant.  The  existence  of  such  M  follows 

from  the  fact  that  vp  -*■  =».  Also,  the  partitioning  of  D  after  the  m-th  round 

will  be  denoted  by  Gm.  A  typical  interval  belonging  to  G^  will  be  denoted 

by  B  or  B.  .  ,  and  a  point  selected  from  it  by  b  or  b.  .  .  Define 

Jr***’Jm  -  “Jr°*°’Jm 

VW  p(llRnlV'  -eV"/3) 

J  l  I 

v"  ’  J1.»-Jm+,P^,^Rnl(bii— •’m1  "  Rn,<-jr-Vi^  -ev"/3'"  ’) 


1 ,2,. » 8  ,M, 


I  m  - 

m  =  1,2,.,. ,M+1 „ 

Note  that  for  any  B  €  GM  .  and  B  e  B  we  have,  in  view  of  (2.39), 


j,ki<e> -*„,<!!>) I  <e-s>i  i("j,fe,(5-t:)2)1/2 

=  ^llg-bi|  <  /np  |b  -  b|  <  /RFv'2M  <  2'1E3'(M+1)v2. 

1  -  *• 1  —  n  n 


Hence 


<<3' 


(M+l )  2 
vn 


which  implies  that 


=  0. 


It  is  easy  to  see  that 


Qm  -  Vm  +  Qm+1’  m  =  » 


(vi 


From  (2.40)-(2.42) ,  it  follows  that 


P(vr20sue|  j1Rni(6-)|  "nV 

8eD  1=1  m=0 


Since 


<  1*^1  i  V„i*n 


-1  2 
,  since  d  v  <  1 
n  n  — 


i^Var(*ni{^j1))  —  ^vn* 

*m^jr ^ -ni^-Ji  — Jm  “jr-.j 

1  II 5ni  HI £i  j 
ni  J1 


<  plx  .  |  •  |  b  .  .  -  b . 

“  *n1  ~Jr 


_j  -2m+2  „  -2m 
<  pd.v  <  pv  , 
—  r  n  n  -  y  n 


and 


V^Y 


J1''aK*„1<&j1...Jrn)-*n1^jl...Jm+1>)  ^  1IIC^i^j1...jtn-^|.-.jm+1)) 


c  p I b .  ,-k.  .  | 2 


<  DV 

-  r  n 


Applying  Lemma  1,  we  get 
<  2v^^exD^-2 


V0  i  Zv^exp^’VvJ/Upvjjtevj)  1  2v^pexp(-cv^).  (2.44) 


5m  i  2vf 1 ( m*1  > ex p< -  ( c 3-"- 1 ) 2/  ( Zpv;4"^4  +  e 3-m- 1  p v;2m+  2 ) } 
^  2v2p(m+1)exp(_c3-(m*l)v2m+2)>  m  ,  , 


(2.45) 


Here  c  >  0  is  a  constant  independent  of  n,  m.  From  (2.43)- (2.45) ,  we  obtain 


P(v^2sup  |  l  Rni(e)|  >.)  <2  1  v2p(m+1)exp(-c3"(m+1)v2m+2). 
BeD  1=1  ”  m=0 


(2,46) 


Since  the  right-hand  side  of  (2.46)  tends  to  zero  as  n  -*•  <*>,  we  obtain  (2,38) 


Now  we  note  that 


sup  max  x  .6  <  d  v  <  v„ 

„  K  ,  .  1  -m- 1  —  n  n  -  n 

eeu  1 < i <  n 


-*•  0,  as  n 


(2,47) 


Hence,  considering  condition  1  of  Theorem  1,  we  have 


E4m<!>  = 


’ (2u  -  2xVe)f(u)du, 


f° 

,(2x\b  -  2u)f(u)du, 

W  “ni‘ 


x'  .6  >  0 

-m -  - 


ini®  <  0 


■  -flOHx^gj^i+oO)) 


(2.48) 


c, 


-+•  00 


for  b  e  D,  i  =  l,...,m,  n  sufficiently  large,  where  o(l)  +  0  as  n 
uniformly  for  B  e  D  andl  <  i  <  n,  in  view  of  (2.47).  From  (2.38)  and 
(2.48) ,  we  get 


lim  v"‘  sud 


.X  n eni  I  “  leni  ~  5ni2 ^  +  ‘F(°)II§II2  =  in  probability 


which  implies  that 


o  n 

r2(  i 

"  N=i 


-  inf 
lel-v. 


5il8l)  <-f(0)(l  ♦„  <!)) 


(2.49) 


where  on(l)  tends  to  zero  in  probability  as  n  >  •».  Since  T?  Je  .  -x'.e 
h  =  1 1  m  ~ni  - 

as  a  function  of  §  is  convex  over  Rp,  (2.49)  implies 


,?leni-5ii!l)  <-f(0)(uo  (D). 


Since  f(0)  >  0,  (2.50)  implies  (2.37),  and  Lemma  4  is  proved. 


(2,50) 


°  Remark.  Let  us  return  temporarily  to  model  (1.1)  and -consider  the 
ML-jN  estimate  gn  defined  by  (1.2).  It  follows  easily  from  Lemma  4  and  (1.8) 
that  Bn  is  a  weakly  consistent  estimate  of  Bq.  For  convenience  of  presenta¬ 
tion  we  formulated  this  fact  as  a  corollary  of  Theorem  1.  Now  we  see  that 
the  verification  of  this  fact  is,  in  fact,  an  important  step  in  the  proof 
of  Theorem  1 . 


c-  j'g  X'm  |V .<> '  .Hj1!  i1!  t  tj'lj' tj  *j M  J.*iL*i«/i 


3.  PROOF  OF  THE  THEOREMS 


V7>7v7CT;/;wwv.7<y;.^ 
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Proof  of  Theorem  1.  As  mentioned  earlier,  we  need  only  to  prove 
Theorem  l1.  We  begin  with  verifying  that  as  n  -*■  ®, 

n 

sup(||  J  ^i^^niH  :  3  6  D)  0,  in  probability.  (3.1) 

D  and  tn_j(e)  were  defined  by  (2.9)  and  (2.12)-(2.14) ,  respectively.  In 
view  of  (1.18),  we  need  only  to  prove  that  as  n 


Q0  =  P(sup{  |  .i  an1tni(8)| :  6  e  D)  >_  e)  -►  0 


(3.2) 


where  { ani }  is  an  arbitrarily  given  constant  array  satisfying  [?_^a^.  =  1, 
and  e  e  (0,1),  also  arbitrarily  given. 

Without  loss  of  generality,  assume  ja^j  >_  ...  >  ) anp ) .  Choose  pn  and 
rm  according  to  (2.7)  and  (2.8),  then  (2.20)  holds  obviously.  Define  Um> 
Vm,  Qm  by  (2.21  )-(2. 23 )  for  1  <_  m  <_  M,  and 


acH  1 ~ I 


V»=,BJa1P  ic/3’ 


(3.3) 


(3.4) 


with  b  e  B.  It  is  easily  verified  that  with  Qq,  Uq,  Vq  defined  by  (3.2)- 
(3.4),  (2.24)  also  holds  true  for  m  =  0.  Hence 


«0  i  U0  +  V0  *  Q1  i  L(Um  +  Vm) 

m=u 

Here  M  is  defined  by  (2.8).  Now  we  show  that 


(3.5) 


a 

k 

& 


**»  i 

f 

i 

8 

I 

8 


K',* 

w 


v 


Uq  +  Vq  0,  as  n 


(3.6) 


By  (2.18),  for  6  e  D,  we  have 


i^Vni^l  i(j11nf(5))1/2i("n(C/>‘n>2) 


,2,1/2 


0,  as  n 


(3.7) 


Further,  ^.(g)  =  0  when  §  e  D  and  |eni|  >  /pdnyn(>  \x\r\)  ,  here  ^.(g)  is 
defined  by  (2.12).  From  (2.7),  (2.19)  and  (3.7),  we  obtain 


rl  rl 

U0  1  *  °>  5  i/deJi'P  Vn>  < 


(3.8) 


Using  (2.17)-(2.20)  and  employing  the  argument  for  proving  (2.36),  it  can  be 


shown  that  there  exists  constant  C,  >  0  such  that 


VQ  1  2^pexp(-C1py2)  -  0, 


as  n 


(3.9) 


Now  (3.6)  follows  from  (3.8)  and  (3.9).  Further,  by  Lemma  3,  we  have,  for  n 


large. 


I  <tJn,  +  Vn1>  I  Mfmexp(-48-1e(pn/9)m/^. 
n=l  m=l  " 


(3.10) 


Since  the  function  x^pmexp{-a(^-)m//2} ,  x  >  0,  attains  its  maximum  at 
x  =  b(6p/a)2/,m,  we  have 

<  u-p"93pm(96p/c)6pe'6p  <  („n/729)pm(96p/E)1 
Hence,  noting  that  yn  we  obtain 


I  <VV±<^>6Pi>n/729>' 


(3,12) 


$ 

Si 


From  (3.5),  (3.6)  and  (3.12),  (3.2)  follows.  This  concludes  the  proof  of 
(3.1). 


By  (3.1)  we  have 


T  t  .  (b  )x  . 
.j“-j  m  ,n'  m 


I(B_€D)  0  in  probability,  as  n 

*  n 


(3.13) 


and  (2.7)  gives,  for  n  large,  A/(/pdn)  >  un  (a  is  the  number  appearing  in 
condition  1  of  Theorem  1).  So  by  Lemma  2  we  have 


[(|  J1sgn^ 


eni  ~  ?nifn^ni 


1  ( p+1 ) dn )l ( 1 6p |  <un)  =  0,  a.s. 


Hence  by  (1.19) 


1J15gn(en1  -^iln^nl  I(I*J  <vn]  *  (p+1)dn  *  °*  a*5’  (3*14) 


From  (2.12)-(2.14) ,  (3.13)  and  (3,14),  we  have 


J  sgn(e„,)x 
1  =  1 


ni  -m 


"Ifni 


0,  in  probability.  (3.15) 


Since  sup{ | | :  |bJ  <  un»  1  1  i  <.  n)  _<  pndn  <_  p2dn  -»•  0,  we  have 
*ni(£n)  =  2f(0)(1+op(1))?ni!n  on  account  of  condition  1  of  Theorem  1, 
where  op(l)  -*■  0  in  probability  as  n  -*■  °°  uniformly  for  1  ^  i  _<  n.  From  this, 
(1.18)  and  (3.15),  we  have 


2f(°)fn-T]s9n(en1);n1 


"Ifni 


<pn> 


-*■  0,  in  probability. 


(3.16) 


In  view  of  (1.18),  (1.19)  and  the  assumptions  on  {epi),  it  follows  by 
Lindeberg's  theorem  that 
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j,s9n(eni>i!m  (3-17) 

From  (3.16),  (3.17)  and  Lemma  4,  we  obtain  (1.20)  (^notice  (2.2)).  This 
concludes  the  proof  of  Theorem  V,  hence  Theorem  1. 

Proof  of  Theorem  3.  The  proof  differs  from  the  above  argument  only  in 
some  minor  details,  therefore  omitted. 


Proof  of  Theorem  2. 


8  =  T1/26 

2n0  In  60* 


Define  Tn  by  (1.12),  and 
an0  =  ^a0  +  Fn?0^’  InO  =  ^anO’^nO^’ 


T“1/2(x  -x  ) 
In  -i  Cn'* 


—  1  )  a  •  o  j  n  j 


zni  =  (l//n,  x;.)',  i  =  1 . n. 

We  transform  the  model  (1.11)  into  the  following  form: 

V1  =  ?ni2n0  +  e1*  i  =  1 . 

Since  I"=]JnjzV  =  I  ,,  and  (1.13)  guarantees  that  max, <j<n|znj I  -  0  as 
n  -v  «,  Theorem  T  can  be  applied,  and  we  obtain 

2f(0)(yn-Yn0)  -4  N(0,  Ip+]),  as  n  -►  «  (3.18) 

where  =  ( anQ •  ) 1 »  and  ang«  ^nQ  are  the  Minimum  L^-Norm  estimates  of 

ano»  §nQ,  respectively.  Now  (3.18)  implies  the  assertion  (1.14)  and  also 
the  asymptotical  independence  of  an0  and  6nQ.  Finally,  (1.15)  follows  easily 
from  what  has  already  been  proved.  Theorem  2  is  proved. 


Remark.  Wu  (1987)  proved  that  in  model  (1.1)  the  ML^N  estimate  §n  is 
strong  consistent  if  the  following  conditions  are  met: 


1°.  {e..}  satisfies  the  condition  stated  in  Theorem  3. 

2°.  Define  dn  =  max(l,  ||  x^||  ,...»||xn|| )  and  pn  =  the  smallest 

2  c 

eigenvalue  of  Sp,  then  pn/(dnlogn)  -+■  dp/n  -*■  o  for  some  c  >  0. 

At  one  time  it  was  expected  that  if  condition  1°  is  replaced  by 

T:  e-j ,  je^.  ...  are  i.i.d.  and  e^  has  a  unique  median  0,  the  conclusion 

of  Theorem  1  is  still  true.  The  motivation  behind  this  conjecture  is  the 
simple  case  of  estimating  a  population  median  by  the  sample  median,  in 
which  the  uniqueness  of  the  population  median  is  enough  for  consistency. 

Yet  the  following  example  shows  that  this  is  not  true: 

Example.  In  model  (1.1)  take  p  =  1  (b  is  one-dimensional),  x^  = 
logn/^n,  n  =  1,2,3,...,  e^ ,  e ...  are  i.i.d.,  e^  has  a  density  function 
f(u)  =  ] u 1 1 ( | u |  <1).  Here  d  =  1»  p_  =  S  ~  1 og  n)2.  Hence  condition  2° 

•I  II  ll  J 

is  fulfilled. 

In  this  example  all  conditions  of  Theorem  1,  except  that  f ( 0 )  >  0,  are 

met.  In  the  course  of  proving  Theorem  V  we  have  already  shown  this  (see 

(3.15)).  (Note  that  in  proving  (3.15)  we  made  no  use  of  f ( 0 )  >  0.) 

- 1  D  n  /in 

p'  I  sgn(e.)x.  -  2  Y  x.  f(u)du  I(|§J  <  1)  -*■  0,  in  probability.  (3.19) 
n  i  =  l  1  1  i  =  l  ^0  n 

Now  if 

Bp  0,  in  probability,.  (3.20) 

then  since  {x^ }  is  bounded,  from  (3.19)  and  f(u)  =  ju|I(ju|  <1)  we  have 

Pp1/2  I  sgn(e.)x.  -  Pp1/26p  ^  xj  ■>  0,  in  probability.  (3.21) 

1  r>  ft 

But  by  Lindeberg's  theorem  we  have  /pT"  £ i  =  1sgn(e..  )x  .  N ( 0 , 1 ) ,  whil 

Ii  =  lxi  bounded  in  n,  pp  -*•  «  and  b  0  in  probability.  Thus  (3.21)  is 
impossible,  which  in  turn  implies  that  (3.20)  is  impossible. 
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APPENDIX  1 

In  this  appendix,  all  notations  and  numbering  of  formula  are  according 
to  Bassett  and  Koenker  (1978),  if  not  defined  here. 

Consider  the  model 

yt  =  xts  +  ut*  *  =  1  »•  •  • 

where  b,  one-dimensional,  is  the  unknown  parameter,  ,  u^,  ...  are  indepen¬ 
dent  random  errors  with  a  common  distribution  N(0,1),  and 

Xj  =  1//?,  x2  =  1  +  ^/10,  X3  =  x4  =  ...  =  1. 

First  we  verify  that  the  minimum  point  of  the  function 

T 

J(e)  =  l  |yt  -  x.b| 

t=l  1  z 

is  unique.  For  if  this  is  not  true,  then  owing  to  the  convexity  of  J(b), 
there  would  exist  an  interval  [a,b],  -»  <  a  <  b  <  >»,  such  that  J(c)  = 
i nf{0 (B ) :  -~  <  B  <  »)  for  each  c  e  (a,b).  Choose  a  point  r  e  (a,b), 
r  f  t  =  1,...,T.  We  should  have  J'(r)  =  0,  i.e., 

-  — sgn(y1  -  r//7)  -  (1  +^)sgn(y2  -  (1  +  y|)r)  -  \  sgn(yt  -  r)  =  0. 

t — 3 

But  this  is  impossible,  since  the  sum  of  the  first  two  terms  is  an  irrational 
number  while  the  third  term  is  an  integer.  This  proves  the  uniqueness  stated 
above. 

Now  in  this  model  H  =  { 1 ,2,3, . . .  ,T) .  By  the  choice  of  { x^} ,  the 
distribution  of  ZT(6,T)  is  nonlattice.  So  according  to  (3.9),  we  should  have 

lim  T1/,2Pr(zT(<5 ,1 )  e  C[0,1  exists  and  not  zero.  (*) 

T-*«  ! 


But 


ZT(S,1)  =  (^+^)sgn(u2-T'1/20 +(|)6)  +  j  /2sgn(ut  -  T“  1/2S) . 

t—  3 

Therefore,  when  T  is  even  the  right-hand  side,  with  probability  one,  equals 
to  an  odd  multiple  of  /2  plus  ±1/5,  which  is  always  outside  [-1,1].  Conse¬ 


quently  we  have 


p(ZT(<S,U  eC[0,l])  =  0,  for  T*  2,4,6,... 


and  (*)  breaks  down. 
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APPENDIX  2 


In  this  appendix,  all  notations  and  numbering  of  formula  are  according 
to  Amemiya  (1982)  if  not  defined  here. 

1.  Denote  by  A-j ( Sq)  the  value  of  A-j  in  (3.12)  taken  at  the  true 
parameter  point  6g.  We  shall  proceed  to  show  that  A^Bq)  •*  «  in  probability 
as  T  -*■  ». 

Define 

1 »  "i ^  |yt  “ 

0,  otherwise. 

Since  Cy  =  Td,  ^  <  d  <  it  follows  that  Cy1  <  T’1Z3.  Hence  Zt  >_  ct»  and 

ai(?o}  =  2ct  i1ztiog(i+e  ) 


-Cf |y 


>  2C;‘  l  5tlog(l+e 
'  t=l  1 


>  2C*1  log(l  +e-1)  J  U  =  A. 
1  t=l  1 


Since  yt  -  x^BQ  =  ufc,  t  =  1,2,...  are  independent  and  identically  distributed 
with  a  common  density  function  f  which  is  continuous  and  f(0)  >  0,  it  follows 
that  there  exist  two  positive  constants  h^  and  h^  not  depending  on  t,  such 
that 

h^Cy  ^_P(?£=1)  h^Cy  *  t  =  1 ,2 , . . .  . 

Therefore  we  obtain 

E(A)  >  2  log(l  +e"1)h]TCy2  ->  ». 

Here  we  used  the  fact  that  Cy  =  Td  and  1/3  <  d  <  1/2.  Further 


Var(A)  <  (2C;1  log(l+e-1))2  f  E?2 

t=l  1 

<  (2C*1  logd+e'1))2^^1 
-  4h2  log2(l  +  e"1  )T/Cj  +  0 

by  the  definition  of  CT  given  above.  From  E(A)  -►  »  and  Var(A)  0,  we  have 
A  -*•  -  in  probability  as  T  +  «.  Since  A^Bq)  >  A,  we  obtain  A^eJ  -►  -  in 
probability  as  T  ®. 


2.  Denote  by  B^Bq)  the  value  of  B1  (in  (3.22))  taken  at  the  true 
parameter  point  §Q.  We  shall  now  show  that  B-j  ( Bq)  -*•  «  in  probability  as 
T  -*■  «. 

Define  as  before.  Since  yt  -  x^Q  =  ut,  we  have 


B1(B0)  =  rV  l  1  1/3  1  IVGnK 

t*l  (|ut|<T1/3cp  ‘  0  1 


where  M  =  sup{|x^|:  t  =  1,2,...}  <  *  by  assumption.  We  have 

E(B)  -  M(e+1)  T  1/2hlCT1  xit 

.  1__.  Tl/2r-l  1  r  2 

"  M(e+1)  hlT  CT  T  tt-|  Xit  as  T  ^  "• 

T  2 

This  is  because  £t_-|X.jt/T  tends  to  a  positive  limit  as  T 
and  that  C-j.  =  T^,  1/3  <  d  <  1/2.  Further 


->  oo 


(by  assumption), 


Var(B)  <  M  ‘(e+D'^r1  ^  |x.tf  E5‘ 

<M2(e+l)"2T"1  J  id 
t=l  t 

_<  M2(e+1)  2T  ThgCy^  -*■  0,  as  T  ->■  ». 

«  and  Var(B)  4  0,  we  have  B  -»•  ® 
the  same  is  true  for  B^(Bq). 


in  probability  as  T 


